Online gradient descent
Like (offline) gradient descent
but instead of
,
we use
,
(the offline optimum)
Assume:
-
are all convex
- Each is G-Lipschitz: for all
,
,
- starting radius:
Online Gradient descent:
- Choose
and
.
- For
:
- Play
- Observe
and incur cost
Online gradient descent
analysis
Online
gradient descent regret bound
#incomplete